The Art of Data Augmentation
نویسندگان
چکیده
The term data augmentation refers to methods for constructing iterative optimization or sampling algorithms via the introductionof unobserved data or latent variables. For deterministic algorithms, the method was popularized in the general statistical community by the seminal article by Dempster, Laird, and Rubin on the EM algorithm for maximizing a likelihood function or, more generally, a posterior density. For stochastic algorithms, the method was popularized in the statistical literature by Tanner and Wong’s Data Augmentation algorithmfor posterior sampling and in the physics literatureby Swendsen andWang’s algorithm for sampling from the Ising and Potts models and their generalizations; in the physics literature, the method of data augmentationis referred to as the method of auxiliary variables. Data augmentationschemes were used by Tanner and Wong to make simulation feasible and simple, while auxiliary variables were adopted by Swendsen and Wang to improve the speedof iterativesimulation.In general,however, constructingdata augmentation schemes that result in both simple and fast algorithms is a matter of art in that successful strategiesvary greatlywith the (observed-data)models being considered.After an overview of data augmentation/auxiliaryvariablesand some recent developmentsinmethods for constructingsuchef cientdataaugmentationschemes,we introduceaneffectivesearchstrategy that combines the ideas of marginal augmentationand conditionalaugmentation, together with a deterministic approximationmethod for selecting good augmentation schemes. We then apply this strategy to three common classes of models (speci cally, multivariate t, probit regression,and mixed-effectsmodels) to obtain ef cient Markov chainMonte Carlo algorithms for posterior sampling. We provide theoretical and empirical evidence that the resulting algorithms, while requiring similar programming effort, can show dramatic improvement over the Gibbs samplers commonly used for these models in practice. A key featureof all thesenewalgorithmsis that they arepositiverecurrentsubchainsof nonpositive recurrentMarkov chains constructed in larger spaces.
منابع مشابه
Data Augmentation for Visual Question Answering
Data augmentation is widely used to train deep neural networks for image classification tasks. Simply flipping images can help learning by increasing the number of training images by a factor of two. However, data augmentation in natural language processing is much less studied. Here, we describe two methods for data augmentation for Visual Question Answering (VQA). The first uses existing sema...
متن کاملExploring Data Augmentation for Improved Singing Voice Detection with Neural Networks
In computer vision, state-of-the-art object recognition systems rely on label-preserving image transformations such as scaling and rotation to augment the training datasets. The additional training examples help the system to learn invariances that are difficult to build into the model, and improve generalization to unseen data. To the best of our knowledge, this approach has not been systemati...
متن کاملEfficient Gaussian Process ClassificationUsing Pólya-Gamma Data Augmentation
We propose an efficient stochastic variational approach to GP classification building on PólyaGamma data augmentation and inducing points, which is based on closed-form updates of natural gradients. We evaluate the algorithm on realworld datasets containing up to 11 million data points and demonstrate that it is up to three orders of magnitude faster than the state-of-the-art while being compet...
متن کاملComparing the Effect of Reamed Exchange Nailing and Augmentation Compression Plating in Treatment of Femoral Shaft Nonunion
Background and purpose: One of the most important complications of bone fractures is nonunion. This research aimed at comparing the effect of exchange nailing and plate augmentation on complications after surgery and the time to achieve desired outcomes. Materials and methods: This descriptive-analytical study was conducted in 12 femoral shaft nonunion cases treated operatively in 2011-2018 (...
متن کاملImproving Salient Object Subitizing
In this study, we investigate the work of [1] on Salient Object Subitizing — the ability to quickly detect a small number of objects in an image. We firstly apply the method described by [1] with a bigger model to show that performance is improved. We than look at how data augmentation and additional data affects our system. We improve the (recently established) state-of-the-art in Salient Obje...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001